AI029

Reinforcement Learning: An Introduction

Finite Markov Decision Processes

Lecture

Lesson 3

Date

2026-04-21

Teacher

AI Tutor

Duration

60 Mins

Learning Objectives

Define the agent-environment interface and the interaction loop.
Formally define Finite Markov Decision Processes (MDPs).
Understand the role of goals, rewards, and returns in task formulation.
Identify the significance of the Markov property in state representation.